Dataset statistics
| Number of variables | 9 |
|---|---|
| Number of observations | 36733 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 2 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 2.5 MiB |
| Average record size in memory | 72.0 B |
Variable types
| Numeric | 9 |
|---|
| Dataset has 2 (< 0.1%) duplicate rows | Duplicates |
AFDP is highly correlated with TIT and 3 other fields | High correlation |
TIT is highly correlated with AFDP and 3 other fields | High correlation |
GTEP is highly correlated with AFDP and 3 other fields | High correlation |
CDP is highly correlated with AFDP and 3 other fields | High correlation |
TEY is highly correlated with AFDP and 3 other fields | High correlation |
AFDP is highly correlated with TIT and 3 other fields | High correlation |
TIT is highly correlated with AFDP and 3 other fields | High correlation |
GTEP is highly correlated with AFDP and 4 other fields | High correlation |
CDP is highly correlated with AFDP and 4 other fields | High correlation |
TAT is highly correlated with GTEP and 2 other fields | High correlation |
TEY is highly correlated with AFDP and 4 other fields | High correlation |
AFDP is highly correlated with TIT and 2 other fields | High correlation |
TIT is highly correlated with AFDP and 3 other fields | High correlation |
GTEP is highly correlated with AFDP and 3 other fields | High correlation |
CDP is highly correlated with AFDP and 3 other fields | High correlation |
TEY is highly correlated with TIT and 2 other fields | High correlation |
AH is highly correlated with AT | High correlation |
AT is highly correlated with AH and 5 other fields | High correlation |
AFDP is highly correlated with TIT and 4 other fields | High correlation |
TIT is highly correlated with AFDP and 4 other fields | High correlation |
GTEP is highly correlated with AT and 5 other fields | High correlation |
AP is highly correlated with AT | High correlation |
CDP is highly correlated with AT and 5 other fields | High correlation |
TAT is highly correlated with AT and 5 other fields | High correlation |
TEY is highly correlated with AT and 5 other fields | High correlation |
Reproduction
| Analysis started | 2022-03-04 21:42:27.545200 |
|---|---|
| Analysis finished | 2022-03-04 21:42:39.048605 |
| Duration | 11.5 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 25708 |
|---|---|
| Distinct (%) | 70.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77.86701549 |
| Minimum | 24.085 |
|---|---|
| Maximum | 100.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 287.1 KiB |
Quantile statistics
| Minimum | 24.085 |
|---|---|
| 5-th percentile | 50.7772 |
| Q1 | 68.188 |
| median | 80.47 |
| Q3 | 89.376 |
| 95-th percentile | 97.3934 |
| Maximum | 100.2 |
| Range | 76.115 |
| Interquartile range (IQR) | 21.188 |
Descriptive statistics
| Standard deviation | 14.46135495 |
|---|---|
| Coefficient of variation (CV) | 0.1857186237 |
| Kurtosis | -0.2745902897 |
| Mean | 77.86701549 |
| Median Absolute Deviation (MAD) | 10.164 |
| Skewness | -0.6280340401 |
| Sum | 2860289.08 |
| Variance | 209.1307869 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 100.12 | 46 | 0.1% |
| 100.14 | 46 | 0.1% |
| 100.16 | 42 | 0.1% |
| 100.15 | 42 | 0.1% |
| 100.11 | 38 | 0.1% |
| 100.09 | 34 | 0.1% |
| 100.13 | 33 | 0.1% |
| 100.1 | 27 | 0.1% |
| 100.17 | 27 | 0.1% |
| 100.06 | 25 | 0.1% |
| Other values (25698) | 36373 |
| Value | Count | Frequency (%) |
| 24.085 | 1 | |
| 24.666 | 1 | |
| 25.987 | 1 | |
| 26.615 | 1 | |
| 27.504 | 1 | |
| 29.27 | 1 | |
| 29.316 | 1 | |
| 29.434 | 1 | |
| 29.475 | 1 | |
| 29.551 | 1 |
| Value | Count | Frequency (%) |
| 100.2 | 4 | < 0.1% |
| 100.19 | 1 | < 0.1% |
| 100.18 | 5 | < 0.1% |
| 100.17 | 27 | |
| 100.16 | 42 | |
| 100.15 | 42 | |
| 100.14 | 46 | |
| 100.13 | 33 | |
| 100.12 | 46 | |
| 100.11 | 38 |
| Distinct | 22523 |
|---|---|
| Distinct (%) | 61.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.71272625 |
| Minimum | -6.2348 |
|---|---|
| Maximum | 37.103 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 62 |
| Negative (%) | 0.2% |
| Memory size | 287.1 KiB |
Quantile statistics
| Minimum | -6.2348 |
|---|---|
| 5-th percentile | 5.75584 |
| Q1 | 11.781 |
| median | 17.801 |
| Q3 | 23.665 |
| 95-th percentile | 29.4848 |
| Maximum | 37.103 |
| Range | 43.3378 |
| Interquartile range (IQR) | 11.884 |
Descriptive statistics
| Standard deviation | 7.447451235 |
|---|---|
| Coefficient of variation (CV) | 0.4204576488 |
| Kurtosis | -0.8265999421 |
| Mean | 17.71272625 |
| Median Absolute Deviation (MAD) | 5.945 |
| Skewness | -0.04354672221 |
| Sum | 650641.5735 |
| Variance | 55.46452989 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 18.525 | 8 | < 0.1% |
| 23.969 | 8 | < 0.1% |
| 11.92 | 8 | < 0.1% |
| 10.925 | 7 | < 0.1% |
| 25.597 | 7 | < 0.1% |
| 20.752 | 7 | < 0.1% |
| 20.72 | 7 | < 0.1% |
| 18.431 | 7 | < 0.1% |
| 12.603 | 7 | < 0.1% |
| 16.792 | 7 | < 0.1% |
| Other values (22513) | 36660 |
| Value | Count | Frequency (%) |
| -6.2348 | 1 | |
| -6.0421 | 1 | |
| -5.9793 | 1 | |
| -5.9031 | 1 | |
| -5.8956 | 1 | |
| -5.8847 | 1 | |
| -5.82 | 1 | |
| -5.8189 | 1 | |
| -5.785 | 1 | |
| -5.7711 | 1 |
| Value | Count | Frequency (%) |
| 37.103 | 1 | |
| 37.098 | 1 | |
| 36.264 | 1 | |
| 35.822 | 1 | |
| 35.461 | 1 | |
| 35.406 | 1 | |
| 35.395 | 1 | |
| 35.21 | 1 | |
| 35.161 | 1 | |
| 35.045 | 1 |
| Distinct | 20495 |
|---|---|
| Distinct (%) | 55.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.925517714 |
| Minimum | 2.0874 |
|---|---|
| Maximum | 7.6106 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 287.1 KiB |
Quantile statistics
| Minimum | 2.0874 |
|---|---|
| 5-th percentile | 2.6666 |
| Q1 | 3.3556 |
| median | 3.9377 |
| Q3 | 4.3769 |
| 95-th percentile | 5.31142 |
| Maximum | 7.6106 |
| Range | 5.5232 |
| Interquartile range (IQR) | 1.0213 |
Descriptive statistics
| Standard deviation | 0.7739355929 |
|---|---|
| Coefficient of variation (CV) | 0.1971550377 |
| Kurtosis | 0.2246259001 |
| Mean | 3.925517714 |
| Median Absolute Deviation (MAD) | 0.4949 |
| Skewness | 0.381096574 |
| Sum | 144196.0422 |
| Variance | 0.598976302 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 4.0076 | 9 | < 0.1% |
| 3.2115 | 9 | < 0.1% |
| 4.176 | 8 | < 0.1% |
| 4.1601 | 8 | < 0.1% |
| 3.7043 | 8 | < 0.1% |
| 3.7056 | 8 | < 0.1% |
| 4.1083 | 8 | < 0.1% |
| 4.25 | 8 | < 0.1% |
| 3.5297 | 8 | < 0.1% |
| 3.8733 | 8 | < 0.1% |
| Other values (20485) | 36651 |
| Value | Count | Frequency (%) |
| 2.0874 | 1 | |
| 2.0992 | 1 | |
| 2.1057 | 1 | |
| 2.1197 | 1 | |
| 2.1395 | 1 | |
| 2.1441 | 1 | |
| 2.1517 | 1 | |
| 2.1597 | 1 | |
| 2.1673 | 1 | |
| 2.185 | 1 |
| Value | Count | Frequency (%) |
| 7.6106 | 1 | |
| 7.5549 | 1 | |
| 7.3189 | 1 | |
| 7.2399 | 1 | |
| 6.9831 | 1 | |
| 6.9779 | 1 | |
| 6.956 | 1 | |
| 6.9312 | 1 | |
| 6.927 | 1 | |
| 6.9259 | 1 |
| Distinct | 799 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1081.428084 |
| Minimum | 1000.8 |
|---|---|
| Maximum | 1100.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 287.1 KiB |
Quantile statistics
| Minimum | 1000.8 |
|---|---|
| 5-th percentile | 1047.8 |
| Q1 | 1071.8 |
| median | 1085.9 |
| Q3 | 1097 |
| 95-th percentile | 1100.1 |
| Maximum | 1100.9 |
| Range | 100.1 |
| Interquartile range (IQR) | 25.2 |
Descriptive statistics
| Standard deviation | 17.53637294 |
|---|---|
| Coefficient of variation (CV) | 0.01621594001 |
| Kurtosis | -0.0457552994 |
| Mean | 1081.428084 |
| Median Absolute Deviation (MAD) | 12.9 |
| Skewness | -0.8882780436 |
| Sum | 39724097.8 |
| Variance | 307.5243757 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1100 | 2735 | 7.4% |
| 1099.9 | 2158 | 5.9% |
| 1100.1 | 1322 | 3.6% |
| 1099.8 | 870 | 2.4% |
| 1100.2 | 527 | 1.4% |
| 1099.7 | 324 | 0.9% |
| 1100.3 | 260 | 0.7% |
| 1099.6 | 186 | 0.5% |
| 1085.4 | 143 | 0.4% |
| 1086.5 | 137 | 0.4% |
| Other values (789) | 28071 |
| Value | Count | Frequency (%) |
| 1000.8 | 1 | |
| 1001.3 | 1 | |
| 1001.4 | 2 | |
| 1002.9 | 1 | |
| 1006.5 | 1 | |
| 1007.9 | 1 | |
| 1009 | 1 | |
| 1009.5 | 1 | |
| 1011.4 | 1 | |
| 1011.7 | 1 |
| Value | Count | Frequency (%) |
| 1100.9 | 1 | < 0.1% |
| 1100.8 | 1 | < 0.1% |
| 1100.7 | 1 | < 0.1% |
| 1100.6 | 3 | < 0.1% |
| 1100.5 | 15 | < 0.1% |
| 1100.4 | 87 | 0.2% |
| 1100.3 | 260 | 0.7% |
| 1100.2 | 527 | 1.4% |
| 1100.1 | 1322 | |
| 1100 | 2735 |
| Distinct | 12967 |
|---|---|
| Distinct (%) | 35.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.56380138 |
| Minimum | 17.698 |
|---|---|
| Maximum | 40.716 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 287.1 KiB |
Quantile statistics
| Minimum | 17.698 |
|---|---|
| 5-th percentile | 19.251 |
| Q1 | 23.129 |
| median | 25.104 |
| Q3 | 29.061 |
| 95-th percentile | 32.9 |
| Maximum | 40.716 |
| Range | 23.018 |
| Interquartile range (IQR) | 5.932 |
Descriptive statistics
| Standard deviation | 4.195957462 |
|---|---|
| Coefficient of variation (CV) | 0.1641366791 |
| Kurtosis | -0.6538527404 |
| Mean | 25.56380138 |
| Median Absolute Deviation (MAD) | 2.488 |
| Skewness | 0.3290213527 |
| Sum | 939035.116 |
| Variance | 17.60605903 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 24.308 | 18 | < 0.1% |
| 24.672 | 16 | < 0.1% |
| 25.463 | 16 | < 0.1% |
| 25.184 | 15 | < 0.1% |
| 25.106 | 15 | < 0.1% |
| 25.487 | 15 | < 0.1% |
| 25.557 | 14 | < 0.1% |
| 25.352 | 14 | < 0.1% |
| 25.443 | 14 | < 0.1% |
| 25.299 | 14 | < 0.1% |
| Other values (12957) | 36582 |
| Value | Count | Frequency (%) |
| 17.698 | 1 | |
| 17.719 | 1 | |
| 17.738 | 1 | |
| 17.741 | 1 | |
| 17.761 | 1 | |
| 17.826 | 1 | |
| 17.857 | 2 | |
| 17.862 | 1 | |
| 17.878 | 2 | |
| 17.912 | 1 |
| Value | Count | Frequency (%) |
| 40.716 | 1 | |
| 40.106 | 1 | |
| 39.37 | 1 | |
| 38.922 | 1 | |
| 38.362 | 1 | |
| 38.171 | 1 | |
| 38.051 | 1 | |
| 37.877 | 1 | |
| 37.873 | 1 | |
| 37.864 | 1 |
| Distinct | 791 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1013.070165 |
| Minimum | 985.85 |
|---|---|
| Maximum | 1036.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 287.1 KiB |
Quantile statistics
| Minimum | 985.85 |
|---|---|
| 5-th percentile | 1003.3 |
| Q1 | 1008.8 |
| median | 1012.6 |
| Q3 | 1017 |
| 95-th percentile | 1024.3 |
| Maximum | 1036.6 |
| Range | 50.75 |
| Interquartile range (IQR) | 8.2 |
Descriptive statistics
| Standard deviation | 6.463345955 |
|---|---|
| Coefficient of variation (CV) | 0.00637995884 |
| Kurtosis | 0.4419933185 |
| Mean | 1013.070165 |
| Median Absolute Deviation (MAD) | 4.1 |
| Skewness | 0.194121007 |
| Sum | 37213106.37 |
| Variance | 41.77484093 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1012.1 | 297 | 0.8% |
| 1010.8 | 288 | 0.8% |
| 1011.8 | 284 | 0.8% |
| 1011.9 | 284 | 0.8% |
| 1011.1 | 283 | 0.8% |
| 1012.2 | 281 | 0.8% |
| 1010.9 | 279 | 0.8% |
| 1012 | 276 | 0.8% |
| 1012.6 | 276 | 0.8% |
| 1012.7 | 275 | 0.7% |
| Other values (781) | 33910 |
| Value | Count | Frequency (%) |
| 985.85 | 1 | |
| 986.16 | 1 | |
| 986.25 | 1 | |
| 986.41 | 2 | |
| 986.43 | 1 | |
| 986.56 | 1 | |
| 986.78 | 1 | |
| 986.87 | 1 | |
| 987.31 | 1 | |
| 987.43 | 1 |
| Value | Count | Frequency (%) |
| 1036.6 | 1 | < 0.1% |
| 1036.5 | 2 | |
| 1036.4 | 2 | |
| 1036.3 | 4 | |
| 1036.2 | 1 | < 0.1% |
| 1036 | 1 | < 0.1% |
| 1035.8 | 3 | |
| 1035.7 | 2 | |
| 1035.6 | 2 | |
| 1035.5 | 2 |
| Distinct | 4447 |
|---|---|
| Distinct (%) | 12.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.06052515 |
| Minimum | 9.8518 |
|---|---|
| Maximum | 15.159 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 287.1 KiB |
Quantile statistics
| Minimum | 9.8518 |
|---|---|
| 5-th percentile | 10.385 |
| Q1 | 11.435 |
| median | 11.965 |
| Q3 | 12.855 |
| 95-th percentile | 13.989 |
| Maximum | 15.159 |
| Range | 5.3072 |
| Interquartile range (IQR) | 1.42 |
Descriptive statistics
| Standard deviation | 1.088795301 |
|---|---|
| Coefficient of variation (CV) | 0.09027760296 |
| Kurtosis | -0.6315875791 |
| Mean | 12.06052515 |
| Median Absolute Deviation (MAD) | 0.637 |
| Skewness | 0.2367915709 |
| Sum | 443019.2702 |
| Variance | 1.185475207 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 11.891 | 55 | 0.1% |
| 11.872 | 47 | 0.1% |
| 11.908 | 46 | 0.1% |
| 11.902 | 45 | 0.1% |
| 11.899 | 44 | 0.1% |
| 11.901 | 43 | 0.1% |
| 11.839 | 43 | 0.1% |
| 11.835 | 43 | 0.1% |
| 11.916 | 43 | 0.1% |
| 11.937 | 41 | 0.1% |
| Other values (4437) | 36283 |
| Value | Count | Frequency (%) |
| 9.8518 | 1 | |
| 9.8708 | 1 | |
| 9.8754 | 1 | |
| 9.8806 | 1 | |
| 9.9044 | 1 | |
| 9.9046 | 1 | |
| 9.9178 | 1 | |
| 9.9239 | 1 | |
| 9.9244 | 1 | |
| 9.9286 | 1 |
| Value | Count | Frequency (%) |
| 15.159 | 1 | |
| 15.083 | 1 | |
| 15.081 | 1 | |
| 15.055 | 1 | |
| 15.043 | 1 | |
| 15.042 | 1 | |
| 15.039 | 1 | |
| 15.031 | 1 | |
| 15.029 | 1 | |
| 15.002 | 1 |
| Distinct | 2769 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 546.1585171 |
| Minimum | 511.04 |
|---|---|
| Maximum | 550.61 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 287.1 KiB |
Quantile statistics
| Minimum | 511.04 |
|---|---|
| 5-th percentile | 529.96 |
| Q1 | 544.72 |
| median | 549.88 |
| Q3 | 550.04 |
| 95-th percentile | 550.3 |
| Maximum | 550.61 |
| Range | 39.57 |
| Interquartile range (IQR) | 5.32 |
Descriptive statistics
| Standard deviation | 6.842360433 |
|---|---|
| Coefficient of variation (CV) | 0.01252815844 |
| Kurtosis | 2.016791705 |
| Mean | 546.1585171 |
| Median Absolute Deviation (MAD) | 0.26 |
| Skewness | -1.755907087 |
| Sum | 20062040.81 |
| Variance | 46.8178963 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 550.01 | 657 | 1.8% |
| 550 | 648 | 1.8% |
| 549.98 | 639 | 1.7% |
| 549.99 | 628 | 1.7% |
| 549.96 | 625 | 1.7% |
| 550.04 | 611 | 1.7% |
| 549.97 | 607 | 1.7% |
| 550.03 | 590 | 1.6% |
| 550.02 | 590 | 1.6% |
| 549.94 | 584 | 1.6% |
| Other values (2759) | 30554 |
| Value | Count | Frequency (%) |
| 511.04 | 1 | |
| 512.45 | 1 | |
| 512.6 | 2 | |
| 513.06 | 1 | |
| 513.09 | 1 | |
| 513.17 | 1 | |
| 513.29 | 1 | |
| 513.47 | 1 | |
| 513.75 | 1 | |
| 514.3 | 1 |
| Value | Count | Frequency (%) |
| 550.61 | 1 | < 0.1% |
| 550.6 | 1 | < 0.1% |
| 550.59 | 1 | < 0.1% |
| 550.57 | 2 | < 0.1% |
| 550.56 | 3 | < 0.1% |
| 550.55 | 4 | < 0.1% |
| 550.54 | 2 | < 0.1% |
| 550.53 | 5 | |
| 550.52 | 8 | |
| 550.51 | 11 |
| Distinct | 6236 |
|---|---|
| Distinct (%) | 17.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 133.5064035 |
| Minimum | 100.02 |
|---|---|
| Maximum | 179.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 287.1 KiB |
Quantile statistics
| Minimum | 100.02 |
|---|---|
| 5-th percentile | 109.03 |
| Q1 | 124.45 |
| median | 133.73 |
| Q3 | 144.08 |
| 95-th percentile | 161.33 |
| Maximum | 179.5 |
| Range | 79.48 |
| Interquartile range (IQR) | 19.63 |
Descriptive statistics
| Standard deviation | 15.61863437 |
|---|---|
| Coefficient of variation (CV) | 0.1169879044 |
| Kurtosis | -0.5001962549 |
| Mean | 133.5064035 |
| Median Absolute Deviation (MAD) | 9.76 |
| Skewness | 0.1165547708 |
| Sum | 4904090.72 |
| Variance | 243.9417396 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 133.78 | 185 | 0.5% |
| 133.74 | 174 | 0.5% |
| 133.76 | 168 | 0.5% |
| 133.67 | 163 | 0.4% |
| 133.79 | 149 | 0.4% |
| 133.72 | 145 | 0.4% |
| 133.75 | 141 | 0.4% |
| 133.73 | 140 | 0.4% |
| 133.77 | 136 | 0.4% |
| 133.68 | 135 | 0.4% |
| Other values (6226) | 35197 |
| Value | Count | Frequency (%) |
| 100.02 | 1 | |
| 100.03 | 1 | |
| 100.04 | 1 | |
| 100.07 | 1 | |
| 100.14 | 1 | |
| 100.17 | 1 | |
| 100.2 | 2 | |
| 100.22 | 1 | |
| 100.32 | 1 | |
| 100.36 | 1 |
| Value | Count | Frequency (%) |
| 179.5 | 1 | |
| 178.31 | 1 | |
| 177.91 | 1 | |
| 177.88 | 1 | |
| 177.49 | 1 | |
| 176.91 | 1 | |
| 176.71 | 1 | |
| 176.55 | 1 | |
| 176.35 | 1 | |
| 176.25 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| AH | AT | AFDP | TIT | GTEP | AP | CDP | TAT | TEY | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | 90.262 | 9.3779 | 2.3927 | 1043.6 | 19.166 | 1020.1 | 10.564 | 541.16 | 110.16 |
| 1 | 89.934 | 9.2985 | 2.3732 | 1039.9 | 19.119 | 1019.9 | 10.572 | 538.94 | 109.23 |
| 2 | 89.868 | 9.1337 | 2.3854 | 1041.0 | 19.178 | 1019.8 | 10.543 | 539.47 | 109.62 |
| 3 | 89.490 | 8.9715 | 2.3825 | 1037.1 | 19.180 | 1019.3 | 10.458 | 536.89 | 108.88 |
| 4 | 89.099 | 9.0157 | 2.4044 | 1043.5 | 19.206 | 1019.1 | 10.464 | 541.25 | 110.09 |
| 5 | 88.783 | 9.0465 | 2.3826 | 1042.5 | 19.304 | 1019.0 | 10.461 | 540.14 | 110.23 |
| 6 | 88.853 | 8.8649 | 2.4237 | 1043.5 | 19.269 | 1019.0 | 10.480 | 540.87 | 110.53 |
| 7 | 88.760 | 8.9862 | 2.4409 | 1043.7 | 19.446 | 1019.2 | 10.475 | 540.65 | 110.61 |
| 8 | 88.624 | 8.9956 | 2.3959 | 1037.3 | 19.217 | 1019.5 | 10.463 | 536.89 | 109.03 |
| 9 | 88.561 | 8.9836 | 2.4067 | 1039.7 | 19.140 | 1019.6 | 10.451 | 538.61 | 109.28 |
Last rows
| AH | AT | AFDP | TIT | GTEP | AP | CDP | TAT | TEY | |
|---|---|---|---|---|---|---|---|---|---|
| 36723 | 98.388 | 10.4540 | 3.5555 | 1053.4 | 18.937 | 1004.5 | 10.327 | 550.03 | 110.78 |
| 36724 | 99.282 | 10.3050 | 3.5339 | 1053.3 | 18.909 | 1004.6 | 10.328 | 550.00 | 110.78 |
| 36725 | 99.995 | 10.2380 | 3.8805 | 1067.5 | 21.206 | 1004.6 | 11.002 | 550.32 | 121.26 |
| 36726 | 100.170 | 10.3470 | 4.3198 | 1084.3 | 24.048 | 1004.9 | 11.685 | 549.98 | 133.74 |
| 36727 | 99.985 | 10.1550 | 3.7043 | 1059.7 | 19.837 | 1005.1 | 10.570 | 549.90 | 115.52 |
| 36728 | 98.460 | 9.0301 | 3.5421 | 1049.7 | 19.164 | 1005.6 | 10.400 | 546.21 | 111.61 |
| 36729 | 99.093 | 7.8879 | 3.5059 | 1046.3 | 19.414 | 1005.9 | 10.433 | 543.22 | 111.78 |
| 36730 | 99.496 | 7.2647 | 3.4770 | 1037.7 | 19.530 | 1006.3 | 10.483 | 537.32 | 110.19 |
| 36731 | 99.008 | 7.0060 | 3.4486 | 1043.2 | 19.377 | 1006.8 | 10.533 | 541.24 | 110.74 |
| 36732 | 97.533 | 6.9279 | 3.4275 | 1049.9 | 19.306 | 1007.2 | 10.583 | 545.85 | 111.58 |
Most frequently occurring
| AH | AT | AFDP | TIT | GTEP | AP | CDP | TAT | TEY | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 95.938 | 23.156 | 4.0547 | 1076.6 | 24.672 | 1004.2 | 11.835 | 549.87 | 127.01 | 5 |
| 0 | 87.328 | 26.067 | 5.0703 | 1099.1 | 29.984 | 1008.3 | 13.038 | 546.78 | 146.14 | 4 |